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Abstract 

of 

GWU/IRRA/TR-83/1 
8  April  1983 


ON  THE  DISTRIBUTION  OF  CYCLES  TO  CRACK  INITIATION 

by 

Refik  Soyer 


This  report  is  a  supplement  to  our  previous  report  on  the  dis¬ 
tribution  of  the  number  of  cycles  to  crack  initiation  in  aircraft 
engine  disks.  The  data  analyzed  in  the  previous  report  are  reconsidered 
here  using  a  new  approach.  Thus,  some  degree  of  familiarity  with  the 
first  report  is  assumed.  We  first  present  a  nonparametric  approach  for 
estimating  the  survival  (reliability)  function.  This  technique  is  ap¬ 
plied  to  the  actual  data,  and  comparisons  are  made  with  the  survival 
curves  that  are  obtained  assuming  the  lognormal,  the  Weibull,  and  the 
inverse  Gaussian  distributions.  An  alternative  informal  goodness  of  fit 
test  for  the  inverse  Gaussian  distribution  is  also  considered.  Our  con¬ 
clusion  is  that  estimated  survival  curves  using  the  lognormal,  the 
Weibull,  and  the  inverse  Gaussian  distributions  behave  similarly, 
whereas  the  survival  curve  based  on  a  nonparametric  approach  is  more 
conservative  as  compared  to  the  other  curves. 
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Institute  for  Reliability  and  Risk  Analysis 

ON  THE  DISTRIBUTION  OF  CYCLES  TO  CRACK  INITIATION 

by 

Refik  Soyer 

1.  OVERVIEW  OF  THE  APPROACH  USED 

In  our  first  report  [Soyer  (1982)],  the  problem  of  choosing  a 
distribution  for  the  number  of  cycles  to  crack  initiation  was  analyzed 
using  plotting  techniques  and  some  formal  goodness  of  fit  tests.  The 
techniques  for  the  Weibull,  the  lognormal,  and  the  inverse  Gaussian  dis¬ 
tributions  that  were  known  to  us  at  that  time  were  discussed  in  detail. 

In  this  report,  the  same  problem,  namely,  the  choice  of  distribu¬ 
tion,  is  considered,  but  by  using  a  different  approach.  A  new  method 
for  estimating  the  survival  function,  suggested  by  Proschan  (1982),  is 
applied  to  the  same  data.  The  estimated  survival  function  called  "non- 
parametric"  survival  function,  is  compared  with  the  survival  functions 
obtained  assuming  different  parametric  life  models. 

In  Section  3  an  alternate  informal  goodness  of  fit  test  for  the 
inverse  Gaussian  distribution  is  discussed  and  applied  to  the  data. 
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2.  THE  NONPARAMETRIC  APPROACH 


In  our  first  report,  three  life  models,  the  Weibull,  the  lognor¬ 
mal,  and  the  inverse  Gaussian,  were  analyzed. 

The  survival  functions  for  these  models  are  of  the  following  forms. 
The  Weibull  model 


F(x)  =  exp 


The  lognormal  model 


x  >  0  ,  6,6  >  0  . 


(2.1) 


F (x)  =  1  -  $ 


Hnx-ii 


J 


x  >  0  ,  y,a  >  0  , 


(2.2) 


where  $(•)  denotes  the  cumulative  distribution  function  of  the  stan¬ 
dard  normal. 

The  inverse  Gaussian  model 


F (x)  =  1  -  [(x/u)-l]}  +  e2Xlu  4>  (-  [l+(x/y)l]| 

or 

F(x)  =  $  (VaTx  (l-(x/y)]3  -  e2^^  $>  C“  A/x  [l+(x/p)0  .  (2.3) 

To  estimate  the  survival  function,  the  parameters  of  the  assumed 
distribution  are  estimated  from  the  data,  and  the  estimated  values 
plugged  in  to  the  above  equations. 

The  maximum  likelihood  estimates  of  the  parameters  of  the  models 
are  given  below. 

The  Weibull  Model.  Estimation  of  parameters  6  and  6  using  the 
maximum  likelihood  method  requires  the  solution  of  the  following  equa¬ 
tions  : 

N  ft 

0  =  l  1-6/6  +  (6/6) (x./6)c]  (2.4) 

i=l  1 

N  ft 

0  -  l  1(1/6)  +  in(x  /S)  -  (x  J&r  ln(x. /&)]  . 
i=l  ill 


(2.5) 


These  equations  are  solved  for  S  and  8  by  using  an  iterative  proce- 

A  A 

dure  and  the  estimates  o  and  8  are  obtained. 

The  Lognormal  Model.  The  maximum  likelihood  estimates  of  p  and 


2 

a  are: 


1  N 

5-S.I  h 

1=1 


N  2 

N 

2 

l  Of.)2  -  N 

I  VN 

-2  i=l  1 

i=l  1 

u  N 

(2.6) 


where  Y.  =  K.n  X. 

l  l 


The  Inverse  Gaussian  Model.  The  maximum  likelihood  estimates  are 


i  N 
^  1  r 

“  ‘  5  l  xi  ' 
1=1 


r1  =  I  y 

A  N 

1=1 


N 


(2.7) 


Xi  X 


2.1  Nonparametric  Estimation 

In  a  recent  paper,  Proschan  presents  a  method  for  estimating  the 
survival  function.  The  method  is  applicable  to  both  censored  and  uncen¬ 
sored  life  data. 

The  method  first  estimates  the  average  "failure  rate"  on  each 
interval  between  successive  failures.  Then  a  survival  function  is  ob¬ 
tained  for  each  interval,  and  the  separate  pieces  are  joined  to  form  a 
continuous  function.  The  continuous  function  gives  the  estimated  values 
of  the  survival  function,  including  the  point  at  which  last  failure  is 
observed.  It  does  not  provide  any  estimate  of  the  survival  function 


beyond  the  last  observed  failure. 


The  failure  rate,  tt(x)  ,  is  defined 


tt(x) 


f(x)  =  f (x) 

1  '  F«  '  F(x)  ' 


where  F(x)  is  the  distribution  function  of  the  random  variable  X  , 
the  time  to  failure  (or  cycles  to  failure),  and  f(x)  is  its  probability 
density  function. 

The  failure  rate  has  a  probabilistic  interpretation,  namely, 

7T(x)dx  represents  the  probability  that  a  device  of  age  x  will  fail  in 
the  interval  (x,  x+.-Jx)  [see  Mann,  Schafer,  and  Singpurwalla  (1974)]. 

In  Proschan  (1982),  an  estimate  of  the  failure  rate  in  the  inter¬ 
val  (z.  , ,  z.]  ,  where  z.  and  z.  ,  denote  the  time  of  the  ith  and 
l-l  l  l  l-l 

(i-l)st  observed  failures,  is  given  as 


77 i  observed  total  time  on  test  *  (  •  ) 

For  our  purposes,  the  definition  of  total  time  on  test  will  be  given 

for  the  case  of  uncensored  samples.  In  Proschan,  it  is  defined  for  the 

general  case,  where  the  withdrawals  are  taken  into  account. 

For  uncensored  data,  the  observed  total  time  on  test  is  defined  as 

TTOT .  =  (z.  -  z.  .)  +  (N-i) (z.  -  z.  _) 
l  l  l-l  l  l-l 


(2.9) 


=  (N-i+l)(zi  -  z._1)  ,  zQ  =  0  , 

where  N  denotes  the  number  of  items  on  test. 

By  using  (2.8)  the  failure  rates  on  successive  intervals  can  be 

A 

estimated.  Once  all  the  estimates  are  obtained,  we  can  use  F  ,  the 


estimator  of  the  survival  function  given  by  Proschan  (1982)  as 


F 


<exp{-TT^x},  0  <  x  <  z^ 

exp{-[n1z1  +  tt2(x-z1)]}  ,  z1  <  x  <  z2 

exp{-[7T1Z1  +  TT2(z2-Z1)  +  ...  +  7Ti(zjL  -  Z._1>  +  TK^X-Z.^}  , 

zi  <  x  ^  zi+1  ;  1  =  i»2,...,n-i 


As  stated  before,  there  is  no  estimator  for  x  >  z1T  . 

N 


(2.10) 


2.2  Application  of  the  Method 

In  this  section  the  results  of  Section  2.1  are  applied  to  the 
data  presented  in  Tables  A.l  and  A. 2  of  the  Appendix  and  the  nonparamet- 
ric  survival  function  is  estimated.  The  nonparametric  survival  function 
is  then  compared  with  the  survival  functions  estimated  assuming  different 
life  models. 


2.2.1  Analysis  of  the  first  set  of  data 

The  nonparametric  estimation  method  is  applied  to  the  data  pre¬ 
sented  in  Table  A.l.  The  application  of  Equation  (2.8)  to  successive 
intervals  (0,z^],  (z^,z2J,  ...»  (z^.z^j  gives  us  the  estimates  of  the 
failure  rate  on  each  interval.  The  estimated  failure  rates,  tt. 's  , 

l 

i  =  1,2,..., 6  ,  are  presented  in  Table  1.  The  estimate  of  the  survival 

A 

function,  F  ,  is  obtained  using  Equation  (2.10).  The  estimated  survival 

A 

function,  F  ,  is  presented  in  Figure  1. 

A  comparison  of  the  nonparametric  survival  function  with  the 
survival  functions  obtained  from  different  life  models  is  a  way  of 
checking  the  appropriateness  of  the  life  model  assumed. 


v 


- 


Table  1:  Estimated  Failure  Rates 


Flight  Hours  to  Failure 
z . 

l 

Estimated  Failure  Rate 

A 

*i 

Failure  Rate  Interval 
(zi-r  zil 

780 

0.00021 

(0, 

780] 

820 

0.00500 

00 

O 

820] 

910 

0.00278 

(820, 

910] 

950 

0.00833 

(910, 

950] 

1050 

0.02000 

(950, 

1050] 

The  estimation  of  the  survival  functions  assuming  different  life 
models  has  already  been  discussed.  The  maximum  likelihood  estimates  of 
the  parameters  of  the  models  and  the  estimated  survival  functions  are 
presented  in  Tables  2  and  3,  respectively. 

The  comparison  of  the  columns  of  Table  3  shows  that  the  lognormal 
(column  3)  and  the  inverse  Gaussian  (column  4)  survival  functions  behave 
similarly,  whereas  the  Weibull  survival  function  (column  2)  differs 
slightly  from  the  other  two.  A  comparison  of  these  three  models  with 
the  nonparametric  survival  function  is  made  in  Figures  2  and  3.  An 
inspection  of  the  figures  indicates  that  the  three  models  seem  equally 


Table  2:  Maximum  Likelihood  Estimates  of  the 
Parameters  of  the  Weibull,  the  Lognormal,  and 
the  Inverse  Gaussian  Distributions 


Weibull  Lognormal  Inverse  Gaussian 


6  =  10.54 

U  =  6.83 

V  =  926.67 

A 

~2 

6  =  972.93 

a  =  0.013 

X  =  72256.10 

Table  3:  Survival  Functions  for 
the  Weibull,  the  Lognormal,  and 
the  Inverse  Gaussian  Distribution 


(1)  (2)  (3)  (4) 

X  *(X)W  ^(X)LN  f(X)IG 


500 

550 

600 

650 

700 

750 

800 

850 

900 

950 

1000 

1050 


1.00 

0.99 

0.99 

0.98 

0.97 

0.94 

0.88 

0.79 

0.64 

0.46 

0.26 

0.11 


1.00 

1.00 

0.99 

0.99 

0.99 

0.96 

0.90 

0.77 

0.59 

0.41 

0.24 

0.13 


1.00 

1.00 

0.99 

0.99 

0.99 

0.97 

0.90 

0.78 

0.60 

0.41 

0.25 

0.14 


appropriate  for  describing  the  data  and  that  the  nonparametric  model  is 
more  conservative  than  the  others. 

2.2.2  Analysis  of  the  second  set  of  data 

The  data  which  will  be  analyzed  in  this  section  are  presented 
in  Table  A. 2  of  the  Appendix.  The  nonparametric  estimation  procedure  is 
applied  to  the  data.  The  estimated  failure  rates,  the  tt.'s  ,  are 

l 


presented  in  Table  4. 


F(x)  Probability 

of’  Survival 


900  1000  1100  X  Equivalent  Flight  Hours  to 

Crack  Initiation 


Hours  to 


11 


Table  4:  Estimated  Failure  Rates 


Cycles  to  Failure 
z . 

l 

Estimated  Failure  Rate 

/V 

7T. 

1 

Failure  Rate  Interval 
(zi-l*  zi] 

1066 

3.00005 

(0,  1066] 

1078 

0.00556 

(1066,  1078] 

1100 

0.00324 

(1078,  1100] 

1188 

0.00087 

(1100,  1188] 

1214 

0.00321 

(1188,  1214] 

1240 

0.00349 

(1214,  1240] 

1310 

0.00143 

(1240,  1310] 

1416 

0.00105 

(1310,  1416] 

1459 

0.00291 

(1416,  1459] 

1624 

0.00087 

(1459,  1624] 

1660 

0.00463 

(1624,  1660] 

1670 

0.02000 

(1660,  1670] 

1679 

0.02778 

(1670,  1679] 

1706 

0.01235 

(1679,  1706] 

1738 

0.01563 

(1706,  1738] 

1780 

0.02381 

(1738,  1780] 

The  nonparametric  survival  function  for  these  data  is  presented 
in  Figure  4. 

The  maximum  likelihood  estimates  of  the  parameters  of  the  three 
life  models  and  their  estimated  survival  functions  are  presented  in 
Tables  5  and  6,  respectively. 


Weibull 


Lognormal 


Inverse  Gaussian 


6  =  6.71 

S  =  1539.60 


U  =  7.25 
a1 2  =  0.033 


Vi  =  1433 
X  =  42983 


Table  6:  Survival  Functions  for 
the  Weibull,  the  Lognormal,  and 
the  Inverse  Gaussian  Distributions 


(1)  (2)  (3)  (4) 

X  F(*)u  FMln  F(x)ig 
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The  comparison  of  the  columns  of  Table  6  shows  results  similar 
to  those  obtained  in  Section  2.2.1.  A  comparison  of  the  life  models 
with  the  nonparametric  survival  function  is  made  in  Figures  5  and  6. 
These  two  figures  show  that  the  three  life  models  behave  similarly  and 
that  the  nonparametric  model  is  initially  more  conservative  than  the 
others. 

3.  ANOTHER  INFORMAL  GOODNESS-OF-FIT  TEST 
FOR  THE  INVERSE  GAUSSIAN  MODEL 

In  our  first  report,  a  distribution-free  Kolmogorov-Smirnov 

test,  assuming  the  parameters  were  known,  was  used  to  test  for  goodness 

of  fit  for  the  inverse  Gaussian  distribution.  It  was  stated  there  that 

a  formal  goodness  of  fit  test  for  this  distribution  was  not  known. 

However,  some  properties  of  the  inverse  Gaussian  distribution  and  its 

relation  with  the  normal  distribution  suggest  the  possibility  of  using 

the  formal  goodness-of-f it  tests  for  the  normal  distribution,  if  certain 

conditions  are  satisfied. 

3.1  Derivation  of  Inverse  Gaussian  Distribution 

The  derivation  of  inverse  Gaussian  distribution  is  discussed  in 
detail  in  Chhikara  and  Folks  (1974). 

If  a  random  variable  X  has  an  inverse  Gaussian  distribution, 
then  its  density  function  is 

,  '1 1/2  .  .2 

f(x;y,A)  =  - x  exp - -  ,  x  >  0  .  (3.1) 

2ttx  _  2y  x 

If  Y  is  another  random  variable,  such  that 


Number  of  Cycles  to  Crack  Initiation 


F(x)  jProbaljility 
lof  Survival 


Number  of  Cycles  to  Crack  Initiation 
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A  (x-y) 


(3.2) 


then  the  density  of  Y  can  be  obtained  as: 


g(y;A/y)  =  l  - 


J_  e-y  /2 


(3.3) 


/4X/y+y^ 


where  <  y  < 


As  Chhikara  and  Folks  (1975)  state,  the  distribution  function  of 


Y  is: 


F(y)  =  $(y) 


+  e2A/M  $  _  -  A\/\i  +  y2  )  , 


(3.4) 


where  $(•)  denotes  the  standard  normal  distribution  function.  We  note 
that  if  A /y  •*■  <»  ,  then  (3.4)  becomes  the  standard  normal  distribution 
function.  The  transformation  (3.2)  is  one-to-one;  therefore,  the 
survival  function  of  X  can  be  obtained  as  in  (2.3). 

Thus,  if  the  ratio  A/y  gets  large,  we  can  use  some  goodness 
of  fit  tests  that  are  available  for  the  normal  distribution. 


3.2  Application  to  the  Data 

For  the  first  set  of  data  which  was  analyzed  in  Section  2.2.1, 
the  maximum  likelihood  estimates  of  X  and  y  were  presented  in  Table 

A  A 

2.  For  this  data  set  the  ratio  X/y  is  approximately  equal  to  78.  For 


this  value,  the  term  $  |-  /4A/y  +  y  j  -*•  0  in  (3.4)  and  the  distribu¬ 
tion  function  of  Y  becomes  <t>(y)  •  Thus,  we  can  use  one  of  the  good¬ 
ness  of  fit  tests  for  the  normal  distribution. 

The  Anderson-Darling  goodness-of-f it  test  has  the  statistic 


A  =  -  l  (2i-l)  [Jin  F(x.)  +  Jtn[l  -  F(xn+1_.)]]  /  n  -  n  . 


For  the  first  set  of  data  the  test  statistic  is  computed  as  A  =  0.33 
which  requires  the  acceptance  of  the  null  hypothesis  that  the  data  are 
from  the  inverse  Gaussian  distribution. 

For  the  second  set  of  data  which  was  analyzed  in  Section  2.2.2, 

A  A 

the  ratio  A/u  is  approximately  equal  to  30.  For  this  value, 

<J>(-/4A/u+y2)  0  in  (3.4)  and  the  distribution  function  of  Y  becomes 

standard  normal.  Thus,  we  can  use  the  Anderson-Darling  test  for  the 

2 

second  set  of  data.  The  test  statistic  is  computed  as  A  =  0.76  , 
which  requires  the  acceptance  of  the  null  hypothesis  that  the  sample  is 
from  the  inverse  Gaussian  distribution. 

4.  CONCLUSION 

The  application  of  the  nonparametric  estimation  approach  to  the 
two  data  sets  indicates  that  the  nonparametric  model  behaves  more  con¬ 
servatively  than  the  parametric  life  models  considered  in  the  analysis. 
The  Weibull,  lognormal,  and  inverse  Gaussian  models  behave  similarly  and 
therefore  they  are  equally  appropriate  as  life  models. 

The  results  obtained  in  Section  3  indicate  that  the  formal  good¬ 
ness  of  fit  tests  for  the  normal  distribution  can  be  used  for  the  in¬ 
verse  Gaussian  model  if  certain  conditions  are  satisfied. 

On  the  basis  of  the  present  analysis,  it  can  be  concluded  that  the 
nonparametric  model  which  behaves  more  conservatively  than  the  others  may 
be  considered  as  an  alternative  model  for  describing  the  distribution  of 
the  number  of  cycles  to  crack  initiation. 
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TABLE  A. 2 


CYCLES  TO  CRACK  INITIATION 
IN  BOLT-HOLES 


Bolt-hole 

Number 

Sawtooth 

Cycles 

1 

1214 

2 

1188 

3 

1240 

4 

1078 

5 

1066 

6 

1100 

7 

1670 

8 

1660 

9 

1679 

10 

1310 

11 

1459 

12 

1624 

13 

1738 

14 

1706 

15 

1416 

16 

1780 

END 
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